Distance Guided Classification with Gene Expression Programming
نویسندگان
چکیده
Gene Expression Programming (GEP) aims at discovering essential rules hidden in observed data and expressing them mathematically. GEP has been proved to be a powerful tool for constructing efficient classifiers. Traditional GEP-classifiers ignore the distribution of samples, and hence decrease the efficiency and accuracy. The contributions of this paper include: (1) proposing two strategies of generating classification threshold dynamically, (2) designing a new approach called Distance Guided Evolution Algorithm (DGEA) to improve the efficiency of GEP, and (3) demonstrating the effectiveness of generating classification threshold dynamically and DGEA by extensive experiments. The results show that the new methods decrease the number of evolutional generations by 83% to 90%, and increase the accuracy by 20% compared with the traditional approach.
منابع مشابه
خوشهبندی دادههای بیانژنی توسط عدم تشابه جنگل تصادفی
Background: The clustering of gene expression data plays an important role in the diagnosis and treatment of cancer. These kinds of data are typically involve in a large number of variables (genes), in comparison with number of samples (patients). Many clustering methods have been built based on the dissimilarity among observations that are calculated by a distance function. As increa...
متن کاملForecasting copper price using gene expression programming
Forecasting the prices of metals is important in many aspects of economics. Metal prices are also vital variables in financial models for revenue evaluation, which forms the basis of an effective payment regime using resource policymakers. According to the severe changes of the metal prices in the recent years, the classic estimation methods cannot correctly estimate the volatility. In order to...
متن کاملFeature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine
We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...
متن کاملPresenting a new equation for estimation of daily coefficient of evaporation pan using Gene Expression Programming and comparing it with experimental methods (Case Study: Birjand Plain)
One of the most important componenets of water management in farms is estimating crops’ exact amount of evapotranspiration (water need). The FAO-Penman-Montheis (FPM) method is a standard method to evaluate other techniques which are used for easy calculation of potential evapotranspiration, when lysimeter datasheets are not available. This study was carried out based on 18 years’ climatic dat...
متن کاملPrediction of Blasting Cost in Limestone Mines Using Gene Expression Programming Model and Artificial Neural Networks
The use of blasting cost (BC) prediction to achieve optimal fragmentation is necessary in order to control the adverse consequences of blasting such as fly rock, ground vibration, and air blast in open-pit mines. In this research work, BC is predicted through collecting 146 blasting data from six limestone mines in Iran using the artificial neural networks (ANNs), gene expression programming (G...
متن کامل